End to End Deep Neural Networks Radio Receiver for Speech Signals
نویسندگان
چکیده
Frequency modulation (FM) is a form of radio broadcasting which is widely used nowadays and has been in use for over a century, since 1906 when the first radio audio broadcast was made. FM performance is severely degraded when the system noise amount exceeds a critical level, above which the traditional FM receiver breaks. Traditional methods used in radio receivers divide the problem into two parts – signal demodulation and speech enhancement. We suggest a software-defined-radio receiver that detects and enhances speech, by adopting an end-to-end learning based approach and utilizing the prior information of transmitted speech message in the demodulation process. The new system yields high performance detection for both acoustical disturbances and communication channel noise
منابع مشابه
Speech signals frequency modulation decoding via deep neural networks
End-to-end learning based approaches have been shown to be effective and are giving excellent performance for many systems with less training data. In this work we present an end to end learning approach for novel application of software defined radio (SDR) that utilizes the prior knowledge of transmitted speech message to detect and denoise the audio speech signal from the inphase and quadratu...
متن کاملA hybrid EEG-based emotion recognition approach using Wavelet Convolutional Neural Networks (WCNN) and support vector machine
Nowadays, deep learning and convolutional neural networks (CNNs) have become widespread tools in many biomedical engineering studies. CNN is an end-to-end tool which makes processing procedure integrated, but in some situations, this processing tool requires to be fused with machine learning methods to be more accurate. In this paper, a hybrid approach based on deep features extracted from Wave...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملSpeech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملEnd-to-end Phoneme Sequence Recognition using Convolutional Neural Networks
Most phoneme recognition state-of-the-art systems rely on a classical neural network classifiers, fed with highly tuned features, such as MFCC or PLP features. Recent advances in “deep learning” approaches questioned such systems, but while some attempts were made with simpler features such as spectrograms, stateof-the-art systems still rely on MFCCs. This might be viewed as a kind of failure f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1704.02046 شماره
صفحات -
تاریخ انتشار 2017